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Abstract 

Background: Excess alcohol consumption has serious adverse effects on health and violence-related harm. In the 
UK around 37% of men and 29% of women drink to excess and 20% and 13% report binge drinking. The potential 
impact on population health from a reduction in consumption is considerable. One proposed method to reduce 
consumption is to reduce availability through controls on alcohol outlet density. In this study we investigate the 
impact of a change in the density of alcohol outlets on alcohol consumption and alcohol-related harms to health 
in the community. 

Methods/Design: A natural experiment of the effect of change in outlet density between 2005-09, in Wales, UK; 
population 2.4 million aged 16 years and over. Data on outlets are held by the 22 local authorities in Wales under 
The Licensing Act 2003. 

The study outcomes are change in (1) alcohol consumption using data from annual Welsh Health Surveys, (2) 
alcohol-related hospital admissions using the Patient Episode Database for Wales, (3) Accident & Emergency 
department attendances between midnight-6am, and (4) alcohol-related violent crime against the person, 
using Police data. 

The data will be anonymously record-linked within the Secure Anonymised Information Linkage Databank at 
individual and 2001 Census Lower Super Output Area levels. New methods of network analysis will be used to 
estimate outlet density. Longitudinal statistical analysis will use (1) multilevel ordinal models of consumption and 
logistic models of admissions and Accident & Emergency attendance as a function of change in individual outlet 
exposure, adjusting for confounding variables, and (2) spatial models of the change in counts/rates of each 
outcome measure and outlet density. We will assess the impact on health inequalities and will correct for 
population migration. 

Discussion: This inter-disciplinary study requires expertise in epidemiology and public health, health informatics, 
medical statistics, geographical information science, and research into alcohol-related violence. Information 
governance requirements for the use of record-linked data have been approved together with formal data access 
agreements for the use of the Welsh Health Survey and Police data. 

The dissemination strategy will include policy makers in national and local government. Public engagement will be 
through the Clinical Research Collaboration-Cymru "Involving People" network, which will provide input into the 
implementation of the research. 

Keywords: Alcohol, Outlet density, Alcohol-related harm, Anonymised record-linkage, Multilevel analysis, 
Spatial analysis 
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Background 

Excess alcohol consumption has many adverse effects on 
health including liver cirrhosis [1], cancer of the oral 
cavity, pharynx, larynx, oesophagus, liver [2] and breast 
[3], high blood pressure [4] and stroke [5]. There is also 
an increased risk of harm resulting from violence includ- 
ing homicide [6], suicide [7], road traffic accidents [8], 
domestic violence [9], and attendance at Emergency 
Departments and Minor Injuries Units for treatment of 
violence-related injuries [10]. 

Binge drinking is typically defined as consuming 
double the guideline limits in a single day during the 
previous week [11]. It is an increasing problem, particu- 
larly rising in young women [12], and is associated with 
anti-social behaviour [13] and around one-half of all vio- 
lent crimes in the UK [14,15]. Binge drinking places an 
acute burden on the NHS as around 20-40% of all 
people presenting to accident and emergency depart- 
ments are intoxicated, increasing to 80% after 12am 
[13]. Epidemiological evidence also suggests that binge 
drinking may reduce the benefits of moderate alcohol 
consumption on coronary heart disease risk [16,17]. 

Recent data show that around 37% of men and 29% of 
women exceed current UK guidelines for safe levels of 
alcohol consumption (defined in women as > 3 units 
per day; men > 4 units per day) and 20% of men and 
13% of women binge drink (defined in women as > 6 
units per day; men > 8 units per day) [18]. Given the 
wide range of harm resulting from this substantial level 
of excess consumption, the potential impact on health at 
the population level from a reduction in consumption is 
considerable. One of the principal evidence-based pol- 
icies recently recommended by the British Medical As- 
sociation to reduce alcohol consumption is to reduce 
easy access to alcohol through controls on hours of sale 
and outlet density [19,20]. In this study we focus on al- 
cohol availability measured as outlet density in response 
to the National Institute for Health Research (NIHR) 
commissioning brief PHR no. 09/3007. 

Outlet density and alcohol-related harm 

A recent systematic review of 44 cross-sectional studies 
found that the most frequently investigated alcohol- 
related harm was violent crime, especially assault [21]. 
The majority of studies (36/44) were from the USA. 
High outlet densities were associated with a higher rate 
of assault, self-reported injuries, car crashes, and high 
rates of pedestrian collisions. Higher outlet density was 
also shown to be associated with increased domestic vio- 
lence and child abuse. 

Very few studies in the review investigated associations 
between outlet density and non-injury health outcomes; 
two studies suggested high outlet densities were asso- 
ciated with an increased rate of sexually transmitted 



infections (STI) and two out of three studies found an 
association with high levels of consumption [21]. Of two 
additional American studies not included in the review, 
one found an association between higher outlet density 
and increased alcohol-related hospital admissions in San 
Diego County, USA [22], and the second study, set in 
California and Louisiana, found an association with 
increased self-reported liver disease, STI, and violence 
[23]. 

Change in outlet density and alcohol-related harm 

A recent review article, which included papers published 
up to 2006, summarised the effects of a change in outlet 
density resulting from three groups of studies: inter- 
rupted time series; natural experiments from alcohol 
bans, and changes in licensing arrangements. None were 
set in the UK. Many of these studies were old and few 
accounted for spatial autocorrelation in their analyses 
(i.e. the lack of independence of proximal small-areas). 
This review did not include any methodological assessment 
of the measurement of outiet density. The overall conclu- 
sion was that an increase in outlet density was associated 
with increased consumption and inter-personal violence, 
but the evidence was weaker for an association with motor 
vehicle crashes [24]. 

A further ten studies have investigated associations be- 
tween change in outlet density and a limited range of 
outcomes, namely violence [25-31], STI's [32,33] and 
suicide [34]. 

A six-year longitudinal study of change in outlet dens- 
ity set in California found that increases in bar and off- 
licensed outlet densities were positively related with an 
increase in the rate of violence, defined as a hospital ad- 
mission with an overnight stay with ICD-9 'E-code' 
E960-969 for injuries resulting from inter-personal vio- 
lence, which are not alcohol-specific [25]. Using the 
same dataset a second paper reported that change in 
each measure of outlet density was positively related to 
likely alcohol-related accident rates, using Police- 
recorded crashes judged to include alcohol, and hospital 
admissions resulting from motor vehicle crashes identi- 
fied from ICD-9 codes E810-E825 (which again are not 
alcohol-specific) [26]. A smaller study of alcohol-related 
crashes two years before and after closure of drive-up al- 
cohol outlets in New Mexico reported a non-significant 
association [27]. 

A time series analysis of Norwegian data from 1960 to 
1995 found outlet density was significantly associated 
with violent crime [28]. A study set in Melbourne found 
that an increase in the number of outlets over a nine- 
year period was positively related to an increase in num- 
bers of violent assaults recorded by the Police as taking 
place between 8pm and 6am (used as a proxy for 
alcohol-related assault for which specific data were not 
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recorded) [29]. The fixed effects analyses, which con- 
trolled for spatial autocorrelation, modelled numbers of 
assaults and outlets rather than rates or density. This 
analysis further categorised postcodes into five clusters, 
based on socio-demographic factors, to investigate more 
detailed relationships and found the strongest associa- 
tions in central and inner-city suburbs. 

A study set in Los Angeles reported that a reduction 
in alcohol availability within census tracts (resulting 
from a riot in 1992) was associated with a significant re- 
duction in gonococcal infection rates [32], partially 
mediated by neighbourhood social capital [33]. A signifi- 
cant reduction was also found for violent assaults [30] 
but a more sophisticated statistical analysis published 
one year later suggested that the association reported for 
violent assaults was no longer significant after using a 
different statistical modelling method [31]. The tenth 
study, using the same Californian dataset, found that 
changes in bar and off-premise outlet densities were 
positively related, and restaurant densities negatively 
related, to attempted and completed suicide rates [34]. 

Outlet density, neighbourhood deprivation and 
population migration 

Two US papers investigated the association between 
outlet density and deprivation. Higher outlet densities 
were found in more deprived areas but paradoxically 
levels of consumption were highest in less deprived 
areas, suggesting that the mismatch between supply and 
demand could result in alcohol-related harm being dis- 
proportionately higher in people living in deprived 
neighbourhoods in proximity to alcohol outlets [35,36]. 
A cross-sectional study set in Glasgow, UK, found that 
the spatial association between outlet density and 
deprivation did not vary systematically, highlighting the 
importance of local context in the design of future stud- 
ies [37]. Although it is clear from published research 
that selective population migration may account for a 
spurious widening of health inequalities over time 
[38,39], there are no studies which explicitly investigate 
this possibility for alcohol-related harm outcomes. 

Measurement of outlet density 

In the longitudinal studies, outlet density was estimated 
simply as the number of outlets per resident population 
in seven of the eight studies which estimated a density 
[25,27-30,33,34]. One study modelled the number of 
outlets with the resident population as an explanatory 
variable [29]. Three studies estimated density using the 
number of road-miles [30-32]. One of these also used 
the geographical area as the denominator and reported 
no difference between the methods [30]. 

Although most cross-sectional papers used resident 
population as the denominator, one study set in 



California noted that there is no standard for measure- 
ment of outlet density and estimated outlet density using 
several different measures for the denominator, includ- 
ing resident population, geographical area (e.g. square 
mileage), and other more spatial measures of exposure 
to outlets including Euclidean distance from home to 
nearest outlet, and number of outlets per 0.5 mile 'buffer 
zone' i.e. the geographical area defined by 10-15 min- 
utes walking time [35]. In this analysis there was little 
difference in the modelled associations between all of 
these measures. A New Zealand study used 10 minutes 
of travelling time by car to define a neighbourhood area, 
with their density measure as the number of outlets 
within each neighbourhood [36]. The only published UK 
research, from Glasgow, estimated outlet density using 
two different measures: outlets per 1000 residents and 
the mean network distance from the centroid of each 
small-area of interest to the nearest outlet [37]. 

Primary research question 

Overall, there is cross-sectional evidence from mainly 
American studies to suggest that higher outlet density is 
associated with alcohol-related harm, particularly involv- 
ing violence. The evidence from longitudinal studies for 
associations between harm and a change in outlet dens- 
ity is less well established. Many methodological ques- 
tions remain over the best way to measure outlet density 
and how to model the data. Little is known on the effect 
of a change in outlet density on inequalities in alcohol- 
related health and the role of population migration. Our 
primary research question for this study is: 

What is the impact of a change in the density of alco- 
hol outlets on alcohol consumption and alcohol-related 
harms to health in the community? 

Our secondary research questions are: 

1. Does a health selection effect from population 
migration at small-area level explain any observed 
associations between outlet density and alcohol- 
related harm? 

2. What effect does change in outlet density have on 
population inequalities in alcohol-related health? 

Methods/Design 

Setting 

The setting is Wales, UK, total population 3 million; 
population 2.4 million aged 16 years and over (2001 
Census); with 22 local authorities (the administrative 
areas of local government). 

Design 

The intervention we propose to evaluate is the change 
in the availability of alcohol, measured as outlet density. 
This complex intervention takes the form of a natural 
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experiment as is common in public health contexts [40], 
for example in changes to policy on the availability of al- 
cohol. These are not amenable to randomisation but 
have a natural non-random variability in geographical lo- 
cation and time. Different local authorities have different 
policies and we will obtain a comprehensive list of these 
policies and their variations, for example a specific pol- 
icy for handling applications for licences in areas where 
there is a danger of saturation. These variations in local 
policy will be associated with increases and decreases in 
outlet densities and we plan to contrast the effects of 
different local policies in our exposure measure and 
hence in the study outcomes. 
We propose four studies: 

Study 1: Primary outcome: alcohol consumption. 

A longitudinal analysis of change in levels of alcohol 
consumption using data from the 75 000 people aged 
16 years and over who anonymously responded to the 
five annual Welsh Health Surveys carried out between 
2005 and 2009 [41]. Consumption will be categorised as 
an ordinal scale of no alcohol, below sensible limit, 
above limit, and binge drinking using the Department of 
Health definition 2007 [42]. 

Study 2 Secondary outcome: hospital admissions. 

A longitudinal analysis of five years of hospital in- 
patient data for residents of Wales aged 16 years and 
over using the Patient Episode Database for Wales 
(PEDW), 2005-09. This includes hospital admissions for 
alcohol-related causes (around 13 000/year) [43]. 

Study 3 Secondary outcome: Accident & Emergency 
attendances. 

A longitudinal analysis of residents of Wales aged 
16 years and over attending an A&E department at night 
(midnight-6am, around 100 000/year) as a proxy for 
alcohol-related injury between 2009 and 2011. 

Study 4 Secondary outcome: alcohol-related violent 
crime. 

A longitudinal analysis of five years of Police data on 
alcohol-related violent crime against the person in 
Wales, 2005 to 2009 (around 47 000/year). 

Data sources 

Use of existing record-linked datasets: the Secure 
Anonymised Information Linkage databank 

The Secure Anonymised Information Linkage (SAIL) 
databank, held and managed within the Health Informa- 
tion Research Unit (HIRU) at Swansea University, con- 
tains health, social and education data on three million 
residents of Wales and currently includes thirteen data- 
sets containing nearly one billion records [44,45]. The 
smallest geographical area for which data are already 
linked and may be released is the 2001 Census Lower 
Super Output Area (LSOA) [46]. These areas vary in size 
but contain an average of 1500 people, with the 



constraint of a minimum population of 1000. There are 
1896 LSOAs in Wales. 

Welsh Demographic Service (WDS) 

The WDS dataset held by NHS Wales Informatics Ser- 
vice (NWIS), the NHS organisation in Wales mandated 
to hold personally identifiable data, contains addresses 
for all individuals who register with a General Practi- 
tioner. Dates for each update of the address record are 
held, thereby providing durations of residency for several 
different homes and the ability to link to local environ- 
ment exposures at each. Demographic data may be used 
to create population sub-groups based on age, gender 
and location for the required date or duration. The 
WDS contains address information linked anonymously 
at the individual level (the anonymised linking field, 
ALF) which is the primary key variable for record- 
linkage. Using a split-file technique, NWIS supplies 
ALFs for the whole population of Wales to the SAIL 
databank [44,45]. 

Residential Anonymised Linking Fields 

A particular strength of the SAIL system is the develop- 
ment of the system for anonymising all households in 
Wales and linking household-level data from local au- 
thorities and others with individual health-related data 
whilst protecting anonymity, using Individual Anonymised 
Linking Fields (ALF) and Residential Anonymised 
Linking Fields (RALF). Address data are matched at 
NWIS where identifiable addresses are replaced with 
RALFs [47]. The residence-based metrics are then 
fully incorporated into the SAIL databank by linking 
RALFs to ALFs. 

Environment Geographic Information System 

In parallel, but separate from the SAIL databank, is the En- 
vironment Geographic Information System (eGIS). The 
eGIS contains a number of datasets obtained from Great 
Britain's mapping agency - the Ordnance Survey. These 
data include the Ordnance Survey Address Base Premium 
[48], which contains point data for all residences and busi- 
nesses, both past, present and future for Wales. The Ord- 
nance Survey MasterMap Integrated Transportation 
Network dataset [49] provides detailed road and footpath 
information and the Ordnance Survey Points of Interest 
database contains places that can be visited, categorised by 
types, such as pubs, clubs or restaurants [50]. These data- 
sets have typically been surveyed to a spatial accuracy 
of ± l-2m, providing high resolution for spatial analysis. 

Alcohol outlets 

There are 22 local authorities in Wales who are respon- 
sible under The Licensing Act 2003 for maintaining pub- 
lic registers of Premises and Personal Licence holders, 
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available annually from 2005. Under The Act these data 
are available for public inspection in council offices. 
Each authority keeps a database (electronic or hard-copy 
) of the name and address of each premise with the full 
unit postcode. All 22 local authorities have agreed to 
supply the data for this study although some authorities 
do not distinguish between on- and off-sales. 

Using the postcode we can georeference each outlet to 
within any geographical boundary using standard GIS 
techniques. These postcodes have been validated by the 
local authorities and we will check each postcode to the 
address using the Ordnance Survey Address Base Pre- 
mium [48]. In this study the basic geographical unit for 
small-area analyses will be the 2001 Census LSOA. 

Estimation of outlet density 

Our literature review found four main approaches to es- 
timate outlet density: (1) number of outlets per capita in 
an administrative unit or geographical area, (2) number 
of outlets per network miles per geographical area, (3) 
number of outlets per geographical area defined by walk- 
ing or drive time (a 'buffer zone'), and (4) closest Eucli- 
dian distance from each subject's home to an outlet. 

We wish to estimate a new measure of outlet density 
which includes a more realistic and robust measure of 
accessibility than previously published methods. We 
will estimate a summary statistic for distance to all 
outlets from each residence within a defined buffer 
zone. The measures of distance calculated will be net- 
work distances and travel times which will more ac- 
curately measure exposure in comparison to Euclidian 
distances. 

We therefore propose a spatial methodology to esti- 
mate outlet density using the unique advantages of the 
WDS population data within the SAIL databank: 

(1) We will calculate the sum (or mean) network 
distance for each individual to each outlet within a 
meaningful network range (10 minutes walk and 
10 minutes drive-time) of each residence (RALF), 
using the service area tool in ArcGIS Network 
Analyst module, together with Ordnance Survey 
MasterMap data and point location facility data in 
the eGIS [49,50]. The individual-level measure of 
outlet density exposure will be a function (e.g. 
reciprocal) of the total (mean) distance; 

(2) We will derive the LSOA density by averaging the 
values for all individual residents in the LSOA. 

The primary analysis will be for all alcohol outlets but 
we will also estimate outlet densities for on and off-sales 
separately for those authorities who can supply this 
categorisation. 



Primary outcome: Welsh Health Survey data 

Data on consumption will be used from the five annual 
consecutive Welsh Health Surveys 2005-09 [41]. Partici- 
pants were asked to state the highest number of units 
they had drunk on any one day in the previous seven 
days, using a standard prompt to convert different types 
and quantities of alcoholic drinks into units (2005-07) 
or by reporting measures of consumption translated into 
units by the dataset coders (2008-09). The classification 
of units into ordinal categories of consumption used in 
the Welsh Health Survey dataset is based on the Depart- 
ment of Health definitions (Table 1) [42]. 

Secondary outcomes 
Hospital admissions 

The SAIL databank includes the complete PEDW data- 
base, which records all inpatient and day case admis- 
sions for residents of Wales. The pseudonymisation 
process results in the encryption of a unique, unidentifi- 
able ID that enables a patient-based analysis rather than 
an admissions -based analysis to be carried out. Each 
hospital spell has an attached ICD-10 diagnosis code in 
one of 14 positions: primary, subsidiary, and 3 rd to 14th 
position. The Office for National Statistics has published 
a list of ICD-10 codes to define causes of death which 
are exclusively alcohol-related i.e. not just where alcohol 
could be a contributory aetiological factor [51]. There 
are several other published classifications of 'exclusively' 
alcohol-related conditions with considerable overlap be- 
tween them [52]. The differences from the ONS defin- 
ition largely relate to non-fatal conditions excluded from 
the ONS cause of death list. In addition are the external 
causes of morbidity and mortality codes for motor ve- 
hicle crashes and injuries resulting from inter-personal 
violence. However these codes are not alcohol-specific. 
In the analysis we will define alcohol-related as the com- 
bined causes of death and condition codes and we will 
perform secondary analyses including ICD-10 'External' 

Table 1 Categorisation of alcohol consumption in the 
Welsh Health Surveys 2005-09 



Category Maximum units drunk on 

any day in the last week 



None 


Did not drink in the last seven days 


Sensible 


Men drinking no more than 4 units, 




women no more 




than 3 units 


Above guidelines, 


Men drinking more than 4 and up to 


less than binge 


and including 8 units, women more 




than 3 and up to and including 6 units 


Binge 


Men drinking more than 8 units, 




women more than 6 units 



Source: References [42]. 
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codes for the motor vehicle crashes and injuries result- 
ing from inter-personal violence. 

Each PEDW record is linked to the LSOA of residence 
to which a deprivation score, such as a quintile, of the 
Welsh Index of Multiple Deprivation 2005 [53], has been 
attached in SAIL. We can then attribute admissions to 
an LSOA and deprivation quintile for the analysis. 

Accident & emergency department data 

The Accident & Emergency Department Data Set 
(EDDS) is a new dataset within HIRU/SAIL and the first 
wave of data for the 13 Accident & Emergency (A&E) 
Departments in Wales for 2009 has been cleaned and 
validated. Data for subsequent years are entered into 
SAIL as it is received, currently on a monthly basis. 
Attendances are recorded for all injuries with the date 
and time of day. There is no definitive uniformly applied 
code for alcohol-related attendances in this dataset and 
so we will use attendance at night (midnight to 6am) as 
a proxy for alcohol-related as the majority of injury 
attendances at night (particularly aged 16-39) are likely 
to be alcohol related. These attendances will be linked to 
RALFs and hence to anonymised network measures for 
the analysis. Using ALFs/RALFs we will also trace atten- 
dees who are subsequently admitted to hospital (by link- 
ing to PEDW) and exclude all those who do not have an 
alcohol-related ICD-10 code. 

The numbers are sufficient for a spatial analysis: there 
were 101 908 night time attendances at the 13 A&E 
departments in Wales in the financial year 2009/10. The 
highest numbers were on Friday to Saturday and Satur- 
day to Sunday nights (15 862, 17 098) compared to week- 
days (weekday range 13 374 to 14 232; weekly mean 14 
558, standard deviation 1386). 

Police data 

Police forces record electronically all incidents reported 
to them. Incidents of violence against the person are 
detailed according to incident type (e.g. assault occasion- 
ing actual bodily harm), the injury incurred, incident lo- 
cation and alcohol use. These incidents are recorded at 
the premises, address or street location level. Eastings 
and Northings locations are recorded, facilitating geore- 
ferencing into LSOAs, although the accuracy of these 
coordinates has not been evaluated. Data sharing agree- 
ments between Cardiff University and the four police 
forces in Wales are now in place. Our work with police 
data on an exploratory trial of a disorder reduction 
intervention found the data to be sufficiently robust for 
research purposes [54]. Although a proportion of violent 
incidents are not reported to the police, these data re- 
main the most detailed and accurate records of violent 
incidents at the premises or street level. 



We conducted pilot work using South Wales Police 
data to establish the feasibility of using police recorded 
incident data for the period of the proposed study, 
2005-09. Approximately 42% of incidents were recorded 
as involving alcohol on the part of the offender or the 
victim. While this rate is slightly below the 50% preva- 
lence among incidents recorded by the British Crime 
Survey [55] we believe that these data are of sufficient 
quality for our study. Furthermore, the recording of inci- 
dent type and presence of injury allows the study to ex- 
plore the role of outlet density for both occurrence and 
severity. The Police data aggregated at LSOA-level will 
be added to SAIL for this study. 

Proposed sample size 

To investigate the magnitude of change in outlets over 
time, in a pilot exercise we have assessed the outlet data 
on licensed premises supplied by four local authorities, 
with a total of 273 LSOAs (mean 13 outlets). The mean 
five-year change for local authorities was 6% (range 5% 
to 46%) which exceeds the 4% change in outlets over a 
six-year period reported in the study of change in outlet 
density which gave sufficient power to detect change in 
a rare (suicide) outcome [34] and the -0.4% mean 
change in outlets over a six-year period reported in a 
second longitudinal study [26]. Thus we expect to have 
sufficient change in outlet density for the analysis. 

For the primary outcome, based on data from five 
Welsh Health Surveys, the sample size is approximately 
75 000. Because we do not yet have the data on the loca- 
tion of alcohol outlets by year, we do not know the dis- 
tribution of changes and so cannot complete a 
conventional sample size calculation. However we previ- 
ously carried out an unpublished analysis of data of ap- 
proximately 60000 respondents to four successive 
Welsh Health Surveys 2005-2008 and can draw on that 
experience. In similar multilevel models to those antici- 
pated here, for the primary outcome analysis, standard 
errors were typically between 0.04 and 0.06, varying be- 
tween model parameters. These led to narrow confi- 
dence intervals for parameter values; for example the 
estimated odds ratio associated with a quintile of 
deprivation was 0.82 with 95% confidence interval (0.76, 
0.88). So with a larger sample size in this study the pre- 
cision will be greater and smaller differences in out- 
comes will be detected. 

For the other outcomes the study population is essen- 
tially the 2.4 million people living in Wales aged at least 
16 years. The outcomes are not rare with approximately 
13 000 alcohol-related admissions, 100000 night time 
A&E attendances, and 47 000 violence against the person 
crimes in Wales annually. We will have very high power 
for detecting small differences between LSOAs at the 
extremes of changes in alcohol outlet density. For 
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example there would be over 80% power for identifying 
a reduction from 5.4 per thousand per annum to 5 per 
1000 per annum in hospital admissions between the ex- 
treme quintiles of change in outlet density. While this 
ignores clustering within LSOA, that effect is unlikely to 
be so large that the power is greatly reduced and there 
will be more power for detecting a larger difference. If 
the correction factor for clustering was two, a common 
finding in studies of this type, the required sample size 
would effectively be doubled. However with the planned 
sample size there would still be over 80% power for 
detecting a reduction from 5.4 per thousand per annum 
to 4.8 per thousand per annum in the admission rate for 
alcohol-related conditions. 

Statistical analysis 

The primary analysis will be for all alcohol outlets but 
we will also analyse the data using outlet densities for 
on- and off-sales separately where available. Our analysis 
plan has to take into account the nature of the data 
available through SAIL. Data on individuals nested 
within households nested within anonymised LSOAs will 
be available. Our analysis plan is: 

(1) Full descriptive statistics. 

(2) Spatial models to explore the relationship between 
counts/rates of the four outcome measures and 
outlet density. The models will account for the year 
and will adjust for aggregated population 
characteristics, such as the percentages of males 
and of certain socio-economic groups, and the 
median age. There are a number of methodologies 
that could be adapted to calculate spatial 
autocorrelation, including the Bayesian approach of 
Besag, York and Mollie [56], the random effects 
models developed by Gruenewald and colleagues 
and used in the three Californian longitudinal 
studies [25,26,34] and geographically weighted 
regression (GWR) models [57-60]. Moreover, more 
complex methodologies which take into account 
network distances - the distance to an outlet based 
on road and footpath measurements - can be 
adapted to explore these relationships. These 
methodologies include network distance 
calculations [61], the two-step floating catchment 
technique [62-64], and modified GWR models [65]. 
The different types of spatial models make different 
assumptions, model spatial autocorrelation 
differently and analyse the data in different ways; 
their relative merits will be explored as part of the 
study. 

(3) A multilevel analysis using the hierarchy outlined 
above. We will use multilevel ordinal models of the 
defined categories of consumption for the WHS 



data and multilevel logistic models of the odds of 
hospital admission and A&E attendance as a 
function of change in individual outlet exposure, 
adjusting for confounding individual variables (age, 
gender, socio-economic status) and LSOA 
deprivation quintile. This method cannot be used 
for the crime data as we only know the location of 
offence with no information on individuals. 
Multilevel modelling is a well-established 
technique, see for example Goldstein 2003 [66], and 
we will fit models using the standard MLwiN [67] 
and R software [68]. 

In order to address the secondary research questions: 

(4) We will assess the impact on health inequalities by 
firstly stratifying the spatial and multilevel models 
using quintiles of the Welsh Index of Multiple 
Deprivation to examine whether the modelled 
associations vary with deprivation category. This 
method was used for postcode sectors in the 
Australian longitudinal analysis [29] . A further 
method which we will use is to model the 
interaction between LSOA deprivation score and 
change in outlet density and the two-way 
interaction including individual SES (using the NS- 
SEC) [69]. This method uses the whole dataset and 
the coefficients of the interaction terms give 
information on the effect on inequalities over time. 

(5) Since the annual estimation of outlet density 
exposure automatically includes people who move 
residence, (from using the WDS within SAIL), our 
main models will correct for population migration. 
We will assess the impact of population migration 
by re-estimating annual outlet densities using the 
baseline population, repeating the models above, 
and comparing to the main models. Health 
selection bias can thus be assessed. 

Presentation and mapping of the results 

An important element of the research will be mapping of 
the data. This will form part of the substantive investigation 
as well as part of the presentation and communication of 
the results. It is well established that mapping health and 
related social and economic data can provide important 
insights into the causal processes that affect health out- 
comes [70]. Global and local statistical measures of spatial 
clusters can be misleading without mapping [71,72], which 
provides an excellent method of revealing potential anom- 
alies in the data. The research will therefore use mapping 
techniques in the specification and testing of the alcohol 
outlet density variable and the spatial model. 

Mapping will also be used as part of the diagnostic 
tests and interpretation of the multilevel models, 
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principally through mapping the areal residuals. This is 
an important element of model building [73] and can be 
used to refine the specification of the model as well as 
identify structure not accounted for by the predictors. 
For instance, systematic variation in the size and direc- 
tion of residuals across space can be indicative of spatial 
autocorrelation and/or spatial heterogeneity caused by 
mis-specification of the model. Mapping the residuals 
and analysing them using spatial statistical techniques 
such as local indicators of spatial autocorrelation can 
prove invaluable in model building. Moreover, mapping 
will allow a visual analysis of the spatial correlation be- 
tween the residuals and the location of the alcohol out- 
lets. A clear relation, such as concentrations of large 
positive/negative residuals around clusters of alcohol 
outlets, could indicate social processes unexplained by 
the model and lead to further analysis. 

Discussion 

This study requires a wide range of inter-disciplinary ex- 
pertise, including epidemiology and public health, health 
informatics, medical statistics, geographical information 
science and health geography, city and regional planning 
and urban spatial analysis, systematic reviews, and vio- 
lence and society alcohol research. The study is overseen 
by an inter-disciplinary and multi-agency study steering 
committee (SSC), constituted to include the principal in- 
vestigator and one co-applicant from Cardiff and Swan- 
sea University, the hinder, the study sponsor (Cardiff 
University), with a representative from South Wales Po- 
lice, the Cardiff Community Safety Partnership, the Clin- 
ical Research Collaboration-Cymru 'Involving People' 
network and experts in the field of alcohol and violence 
research, and alcohol policy. 

Research governance and ethical arrangements 

The information governance requirements for use of the 
Secure Anonymised Information Linkage (SAIL) dataset 
in this study have been approved by the Independent In- 
formation Governance Review Panel (IGRP), with mem- 
bership comprised of senior representatives from the 
British Medical Association (BMA), the National Re- 
search Ethics Service (NRES), Public Health Wales, NHS 
Wales Informatics Service (NWIS), and Involving 
People. 

Participant consent is not required because all the 
SAIL study outcome data on patients (hospital admis- 
sions and A&E attendances) will be anonymised before 
they are incorporated into the SAIL databank. As the 
SAIL databank is fully anonymised, it does not fall into 
the remit of the National Information Governance Board 
who provide section 251 (formerly section 60) exemp- 
tion to use identifiable data without consent. 



The anonymised Welsh Health Survey dataset is sup- 
plied under a formal Data Access Agreement between 
Cardiff University and the Welsh Government. Welsh 
Health Survey data do not have permission from the 
Data Owner (Welsh Government) to link into SAIL at 
individual-level but can be linked, using data aggregated 
by LSOA, to outlet density exposures. Police data are 
also supplied under Data Access Agreements between 
each Police Force in Wales and Cardiff University and 
can also only be linked at aggregate LSOA-level. 

The Chairman of the Research Ethics Committee for 
Wales has confirmed in writing that this study does not 
require ethical approval under NHS research governance 
arrangements. 

Service users/public involvement 

The scope for patient involvement is limited as we will 
carry out a secondary analysis of anonymously record- 
linked datasets. The primary vehicle in Wales for public 
engagement, with whom we will work closely, is the In- 
volving People network, which aims to provide input 
into the strategy, development and implementation of 
health and social care research in Wales. 

Dissemination 

A dissemination strategy is planned to include policy 
makers in national and local government and the re- 
search community. Maps will also form an important 
part of the presentation and communication of the 
results to different audiences. Visual information and 
maps are able to present a large amount of complex in- 
formation in an efficient manner and will be able to 
visually demonstrate any spatial correlations found be- 
tween alcohol outlets and the study outcomes. 
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